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Abstract 

A topological approach to stratification learning is developed for point cloud 
data drawn from a stratified space. Given such data, our objective is to infer which 
points belong to the same strata. First we define a multi-scale notion of a stratified 
space, giving a stratification for each radius level. We then use methods derived 
from kernel and cokernel persistent homology to cluster the data points into differ- 
ent strata, and we prove a result which guarantees the correctness of our clustering, 
given certain topological conditions; some geometric intuition for these topologi- 
cal conditions is also provided. Our correctness result is then given a probabilistic 
flavor: we give bounds on the minimum number of sample points required to in- 
fer, with probability, which points belong to the same strata. Finally, we give an 
explicit algorithm for the clustering, prove its correctness, and apply it to some 
simulated data. 

1 Introduction 

Manifold learning is a basic problem in geometry, topology, and statistical infer- 
ence that has received a great deal of attention. The basic idea is as follows: given 
a point cloud of data sampled from a manifold in an ambient space R k , infer the 
underlying manifold. A limitation of the problem statement is that it does not ap- 
ply to sets that are not manifolds. For example, we may consider the more general 
class of stratified spaces that can be decomposed into strata, which are manifolds 
of varying dimension, each of which fit together in some uniform way inside the 
higher dimensional space. 

In this paper, we study the following problem in stratification learning: given 
a point cloud sampled from a stratified space, how do we cluster the points so that 
points in the same cluster are in the same stratum, while points in different clusters 
are not? Intuitively, the strategy should be clear: two points belong in the same 
stratum if they "look the same locally," meaning that they have identical neighbor- 
hoods, within the larger space, at some very small scale. However, the notion of 
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"local" becomes unclear in the context of sampling uncertainty, since everything 
becomes quite noisy at vanishingly small scale. In response, we introduce a radius 
parameter r and define a notion of local equivalence at each such r. 

Our tools are derived from algebraic topology. In particular, we define local 
equivalence between points via maps between relative homology groups, and we 
then attempt to infer this relation by using ideas coming from persistent homology 

ED 

Prior Work Consistency in manifold learning has often been recast as a homol- 
ogy inference statement: as the number of points in a point cloud goes to infinity, 
the inferred homology converges to the true homology of the underlying space. 
Results of this nature have been given for manifolds |28 29 1 and a large class of 
compact subsets of Euclidean space (6). Stronger results in homology inference 
for closed subsets of a metric space are given in 1 111 . 

Geometric approaches to stratification inference have been developed. These 
include inference of a mixture of linear subspaces | 25 1, mixture models for general 
stratified spaces [ 2 1 1 , and generalized Principal Component Analysis (GPC A) [ 32 1 
which was developed for dimension reduction for mixtures of manifolds. 

The study of stratified spaces has long been a focus of pure mathematics; see, 
for example, 11911331 . The problem of inference for the local homology groups of 
a sampled stratified space in a deterministic setting has been addressed in (3] . 

Results In this paper we propose an approach to stratification inference based 
on local homology inference; more specifically, based on inference of the kernels 
and cokernels of several maps between groups closely related to the multi-scale lo- 
cal homology groups for different pairs of points in the sample. The results in this 
paper are: (1) a topological definition of two points belonging to the same strata by 
assessing the multi-scale local structure of the points through kernel and cokernel 
persistent homology; (2) topological conditions on the point sample under which 
the topological characterization holds - we call this topological inference; (3) a 
geometric intuition of these topological conditions based on quantities related to 
reach and to the gradient of a distance function; (4) finite sample bounds for the 
minimum number of points in the sample required to state with high probabil- 
ity which points belong to the same strata; (5) an algorithm that computes which 
points belong to the same strata and a proof of correctness for some parts of this 
algorithm. 

Outline We review the needed background in Section [5] In Section [5] we give 
the topological inference theorem and provide some geometric intuition. The prob- 
abilistic statements are provided in Section]?] We describe the clustering algorithm 
in Section [5] the correctness proof of the algorithm is contained in three Appen- 
dices, [AjthroughjC] The main body of the paper closes with some further discus- 
sion in Section|7] 

2 Background 

We review necessary background on persistent homology and stratified spaces. 
The treatment of the former here is mostly adapted from [5], although we present 
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Figure 1 : The vector v is born at level i and then it dies at level j. 



the material in slightly simplified form. We first describe general persistence mod- 
ules, focusing mainly on those that arise from maps between homology groups 
induced by inclusions of topological spaces. We then discuss stratifications and 
their connection to the local homology groups of a topological space. Basics on 
homology itself are assumed; for a readable background, see 1271 or 1221 . or 1151 
for a more computationally oriented treatment. 

2.1 Persistence Modules 

In 0, the authors define persistence modules over an arbitrary commutative ring 
R with unity. For simplicity, we restrict immediately to the case R = Z/2Z. Let 
A be some subset of R. Then a persistence module Ta is a collection {F a } a £A 
of Z/2Z-vector spaces, together with a family {/f : F a — > Fp} a <p,zA of linear 
maps such that a < [3 < 7 implies /J = fl o f&. We will assume that the index 
set A is either M or R>o and not explicitly state indices unless necessary. 

A real number a is said to be a regular value of the persistence module T 
if there exists some e > such that the map f™tg is an isomorphisms for each 
S < e. Otherwise we say that a is a critical value of the persistence module; if 
A — R>o, then a = will always be considered to be a critical value. We say that 
T is tame if it has a finite number of critical values and if all the vector spaces F a 
are of finite rank. Any tame R>o-module T must have a smallest non-zero critical 
value p[Ty, we call this number the feature size of the persistence module. 

Assume T is tame and so we have a finite ordered list of critical values = 
Co < ci < . . . < c m . We choose regular values {a;}™l such that c;_i < a;_i < 
c; < at for all 1 < i < m, and we adopt the shorthand notation Fi = F ai and 
fl : Fi — > Fj, for < i < j < m. A vector v € Ti is said to be born at 
level i if v ^ im.fi— 1, an d such a vector dies at level j if //(«) G im/jLj but 
fl ( u ) i- i m fi~i- This is illustrated in Figurejl] We then define P' ,J to be the 
vector space of vectors that are born at level i and then subsequently die at level j, 
and p 1 ' 1 denotes its rank. 

2.1.1 Persistence Diagrams 

The information contained within a tame module T is often compactly represented 
by a persistence diagram, Dgm(J r ). This diagram is a multi-set of points in the 
extended plane. It contains j3 1 ' 3 copies of the points (a,Cj), as well as infinitely 
many copies of each point along the major diagonal y — x. In Figure [5] the per- 
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F a - C — ► Fp +e F a+e F fi+e 




Figure 2: Commuting diagrams for strongly interleaving persistence modules. 

sistence diagrams for a curve and a point cloud sampled from it are displayed; see 
Section [2T2] for a full explanation of this figure. 

For any two points u — (x,y) and it' = (x',y') in the extended plane, we 
define | u — u' \ \ oo = max{ \x — x' | , | y — y' | } . We define the bottleneck distance 
between any two persistence diagrams D and D' to be: 

d B {D,D')= inf sup | \u — Y(u)\ |oo, 
r-.D-tD' ueI3 

where F ranges over all bijections from D to D' . Under certain conditions which 
we now describe, persistence diagrams will be stable under the bottleneck distance. 

Two persistence modules T and Q are said to be strongly ^-interleaved if, for 
some positive e, there exist two families {£ Q : F a — > G a +e} a and {%p a : G a — > 
F a + C } of linear maps which commute with the module maps {/„} and {gf } in 
the appropriate manner. More precisely, we require that, for each a < f3, the four 
diagrams in Figure|2]all commute. 

We can now state the diagram stability result (|5|, Theorem 4.4), that we will 
need later in this paper. 

Theorem 2.1 (Diagram Stability Theorem) Let J" and Q be tame persistence 
modules and e > 0. If J- and Q are strongly ^.-interleaved, then 

dfl(Dgm(JO,Dgni(j?)) < e. 



When we wish to compute the persistence diagram associated to a module _F, 
it is often convenient to substitute another module Q, usually one defined in terms 
of simplicial complexes or other computable objects. The following theorem (| 15 1, 
p. 159) gives a condition under which this is possible. 

Theorem 2.2 (Persistence Equivalence Theorem) Given two persistence mod- 
ules J- and Q, suppose there exist for each a isomorphisms F a = G a which 
commute with the module maps, then Dgm(J r ) = Dgm(CJ). 

2.1.2 (Co)Kernel Modules 

Suppose now that we have two persistence modules T and Q along with a family 
of maps {<f> a : F a — > G a } which commute with the module maps - for every pair 
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Figure 3: Illustration of a point cloud and its persistence diagram. Top: X is the curve embedded 
as shown in the plane and U is the point cloud. Bottom left: the persistence diagram Dgnij (d%); 
Bottom right: the persistence diagram Dgnij (du). 



a < f3, we have o <f> a — <f>p o /f . In other words, every square commutes in 
the diagram below: 

f 13 

. . . ->F a ^ Fp ->• . . . 

i <l>a i<t>/3 

. . . — >G a — > Gp — ¥ . . . 

Then, for each pair of real numbers a < /?, the restriction of to ker <f> a maps 
into ker^, giving rise to a new kernel persistence module, with persistence dia- 
gram denoted by Dgm(ker (j>). Similarly, we obtain a cokernel persistence mod- 
ule, with diagram Dgm(cok (f>). 



2.2 Homology 

Our main examples of persistence modules all come from homology groups, either 
absolute or relative, and the various maps between them. Homology persistence 
modules can arise from families of topological spaces {X a }, along with inclusions 
X a <—t X^ for all a < p. Whenever we have such a family, the inclusions induce 
maps HjfXot) — > Hj(X^), for each homological dimension j > 0, and hence 
we have persistence modules for each j. Denning H(X Q ) = (J)^ H 3 (X a ) and 
taking direct sums of maps in the obvious way, will also give one large direct-sum 
persistence module {H(X a )}. 

2.2.1 Distance Functions 

Here, the families of topological spaces will be produced by the sublevel sets of 
distance functions. Given a topological space X embedded in some Euclidean 
space R , we define dx as the distance function which maps each point in the 
ambient space to the distance from its closest point in X. More formally, for each 
y £ M. N , d%(y) = inf^gx dist (x, y). We let X a denote the sublevel set d^ 1 [0, a]; 
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Figure 4: Left: The space X is in solid line and the closed ball B has dotted boundary. Right: 
the persistence diagram for the module {Hi(X a n B, X a n dB)}. 

each sublevel set should be thought of as a thickening of X within the ambient 
space. Increasing the thickening parameter produces a growing family of sublevel 
sets, giving rise to the persistence module {H(X a )} a git>o; we denote the persis- 
tence diagram of this module by Dgm(dx) and use Dgm^dx) for the diagrams 
of the individual modules for each homological dimension j. 

In Figure [3] we see an example of such an X embedded in the plane, along 
with the persistence diagram Dgm 1 (dx). We also have the persistence diagram 
Dgm^ (du), where U is a dense point sample of X. Note that the two diagrams are 
quite close in bottleneck distance. Indeed, the difference between the two diagrams 
will always be upper-bounded by the Hausdorff distance between the space and its 
sample; this follows from Theorem |2.1| 

Persistence modules of relative homology groups also arise from families of 
pairs of spaces, as the next example shows. Referring to the left part of Figure]?] 
we let X be the space drawn in solid lines and B the closed ball whose boundary is 
drawn as a dotted circle. By restricting dx to B and also to dB, we produce pairs 
of sublevel sets (X a n B, X Q n dB). Using the maps induced by the inclusions of 
pairs, we obtain the persistence module {H(X Q n B, X Q D dB)} a ^ >0 of relative 
homology groups. The persistence diagram, for homological dimension 1, appears 
on the right half of Figure]?] 

2.3 Stratified Spaces 

We assume that we have a topological space X embedded in some Euclidean space 
M. N . A (purely) d- dimensional stratification of X is a decreasing sequence of 
closed subspaces 

X = X d 3X d _i D ...X D X_i = 0, 

such that for each i, the i-dimensional stratum Sj = X; — Xi_i is a (possibly 
empty) i-manifold. The connected components of Si are called i-dimensional 
pieces. This is illustrated in Figure|5] where the space X is a pinched torus with a 
spanning disc stretched across the hole. 

One usually also imposes a requirement to ensure that the various pieces fit 
together uniformly. There are a number of different ways this can be done (see 
1 23 1 for an extensive survey). For example, one might assume that for each a; £ Si, 
there exists a small enough neighborhood N(x) C X and a (d— i — 1) -dimensional 
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Figure 5: The coarsest stratification of a pinched torus with a spanning disc stretched across the 
hole. 




Figure 6: The cones c(L x ) and c(L y ), where x and y are respectively in the 0-stratum and the 
1-stratum, are highlighted. 

stratified space L x such that jV(a;) is stratum-preserving homeomorphic to the 
product of an i-ball and the cone on L x ; one can then show that the space L x 
depends only on the particular piece containing x. This definition is illustrated in 
Figure [6] 

Since the topology on X is that inherited from the ambient space, this neigh- 
borhood N(x) will take the form X n B r (x), where B r (x) is a small enough ball 
around x in the ambient space. 

We note that the above definition requires all strata to be contained within 
the closure of the top-dimensional stratum. It is also possible, of course, to have 
spaces where this is not the case: for example, a two-dimensional plane that has 
been punctured by a line. In this case, a slight adjustment to the above definitions 
can be made in order to impose similar notions of uniformity. 

2.3.1 Local Homology and Homology Stratifications 

Recall ([27 1) that the local homology groups of a space X at a point a; £ X are 
the groups Hi(X, X — a;) in each homological dimension i. If X happens to be 
a d-manifold, or if x is simply a point in the top-dimensional stratum of a d- 
dimensional stratification, then these groups are rank one in dimension d and trivial 
in all other dimensions. On the other hand, the local homology groups for lower- 
stratum points can be more interesting; for example if x is the crossing point in 
FigurejT] then Hi (X, X — x) has rank three. 

If x and y are close enough points in a particular piece of the same stra- 
tum, then there is a natural isomorphism between their local homology groups 
H(X, X — x) = H(X,X — y), which can be understood in the following man- 
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ner. Taking a small enough radius r and using excision, we see that the two local 
homology groups in question are in fact just H(X n _B r (a;),X n dB r (x)) and 
H(X n B r (y),X n dB r (y)). Both of these groups will then map, via intersection 
of chains, isomorphically into the group H(XnB r (3;)n-Br(j/), d(B r (x)nB r (y)), 
and the isomorphism above is then derived from these two maps. See the points in 
Figure|7]for an illustration of this idea. 

In 1 31 1, the authors define the concept of a homology stratification of a space 
X. Briefly, they require a decomposition of X into pieces such that the locally 
homology groups are locally constant across each piece; more precisely, that the 
maps discussed above be isomorphisms for each pair of close enough points in 
each piece. 

3 Topological Inference Theorem 

From the discussion above, it is easy to see that any stratification of a topological 
space will also be a homology stratification. The converse is unfortunately false. 
However, we can build a useful analytical tool based on the contrapositive: given 
two points in a point cloud, we can hope to state, based on their local homology 
groups and the maps between them, that the two points should not be placed in 
the same piece of any stratification. To do this, we first adapt the definition of 
these local homology maps into a more multi-scale and robust framework. More 
specifically, we introduce a radius parameter r and a notion of local equivalence, 
~ r , which allows us to group the points of X, as well as of the ambient space, into 
strata at this radius scale. We then give the main result of this section: topological 
conditions under which the point cloud U can be used to infer the strata at different 
radius scales. 

3.1 Local Equivalence 

We assume that we are given some topological space X embedded in some Eu- 
clidean space in M. N . For each radius r > 0, and for each pair of points p,q € K", 
we define the following homology map 4> x (p, q, r): 

H (X n B r (p) , X n dB r (p) ) -> H (X n B r (p) n B T (g) , x n d{B r (p) n B r (g))). 

(1) 

Intuitively, this map can be understood as taking a chain, throwing away the parts 
that lie outside the smaller range, and then modding out the new boundary. Alter- 
natively, one may think of it as being induced by a combination of inclusion and 
excision. A formal definition is given in Appendix A. 

Using these maps, we impose an equivalence relation on R . 

Definition 3.1 (Local equivalence) Two points x and y are said to have equiva- 
lent local structure at radius r, denoted x ~ r y, iff there exists a chain of points 
x — Xo,Xi,... ,x m = y from X such that, for each 1 < i < m, the maps 
(f^(xi-\, Xi, r) and (f) X (xi, Xi-i, r) are both isomorphisms. 

In other words, x and y have the same local structure at this radius iff they can 
be connected by a chain of points which are pairwise close enough and whose 
local homology groups at radius r map into each other via intersection. Different 
choices of r will of course lead to different equivalence classes. For example, 
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Figure 7: Left: x ~ r y, y oo r z. Right: the 1-dim persistence diagram, for the kernel of the map 
going from the z ball into its intersection with the y ball. A number, i.e., #2, labeling a point in 
the persistence diagram indicates its multiplicity. 



consider the space X drawn in the plane as shown in the left half of Figure [7] At 
the radius drawn, point z is equivalent to the cross point and is not equivalent to 
either the point x or y. Note that some points from the ambient space will now be 
considered equivalent to x and y, and some others will be equivalent to z. 

On the other hand, a smaller choice of radius would result in all three of x, y, 
and z belonging to the same equivalence class. 

3.1.1 (Co)Kernel Persistence 

In order to relate the point cloud U to the equivalence relation ~ r , we must first 
define a multi-scale version of the maps </!> x (p, q, r); we do so by gradually thick- 
ening the space X. Let dx : ^S. N — > K denote the function which maps each point 
in the ambient space to the distance from its closest point on X. For each a > 0, 
we define X a = 1 [0, a]. For each p, q, and r, we will consider the intersection 
map (fa{p, q, r), which is defined by substituting X Q for X in |TJ. Note of course 
that(p x (p,q,r) = ${p,q,r). 

For the moment, we fix a choice of p, q, and r, and we use the following 
shorthand: 

B p x (a)=X Q nB r (p), 
dB^(a) =X a lldB r ( P ), 
Bp q (a) =X a ll B r (p) n B r (q), 
dBf q (a) = X a n d{B r (p) n B r (q)). 

and we also often write Bp = B*(0) and Bp q = Bf q (0). By replacing X with U 
in this shorthand, we also write Bp (a) = U a H B r (p), and so forth. 

For any pair of non-negative real values a < j3 the inclusion X a *-¥ X^ gives 
rise to the following commutative diagram: 

H(B^(a),dB^(a)) ^ H(B^ q (a),dB^ q (a)) 
H(flJ(/9),0Bj(/?)) % H(Bp q (l3) , dBp q (/3)) (2) 
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Hence there are maps ker<f> a — > ker^ and cok<^ a — > coktjfp- Allow- 
ing a to increase from to oo gives rise to two persistence modules, {ker^J} 
and {cok</!> x }, with diagrams Dgm(ker0 x ) and Dgm(cok0 x ). Recall that a 
homomorphism is an isomorphism iff its kernel and cokernel are both zero. In 
our context then, the map X is an isomorphism iff neither Dgm(ker cj> x ) nor 
Dgm(cok (j) X ) contain any points on the y-axis above 0. 

Example. As shown in the left part of Figure|7] x, y and z are points sampled 
from a cross embedded in the plane. Taking r as drawn, we note that the right part 
of the figure displays Dgm 1 (ker X ), where <^ x = 4> x (z,y,r); we now explain 
this diagram in some detail. The group Hi (_B X , <9Bj) has rank three; as a possible 
basis we might take the three classes represented by the horizontal line across 
the ball, the vertical line across the ball, and the two short segments defining the 
northeast-facing right angle. Under the intersection map <j!> x = <^> x , the first of 
these classes maps to the generator of Hi {Bf y , dB^ y ), while the other two map to 
zero. Hence ker X has rank two. Both classes in this kernel eventually die, one 
at the q value which fills in the northeast corner of the larger ball, and the other at 
the a value which fills in the entire right half; these two values are the same here 
due to symmetry in the picture. At this value, the map X is an isomorphism and 
it remains so until the intersection of the two balls fills in completely. This gives 
birth to a new kernel class which subsequently dies when the larger ball finally fills 
in. The diagram Dgm 1 (ker<^) X ) thus contains three points; the leftmost two show 
that the map <j!> x is not an isomorphism. 

3.2 Inference Theorem 

Given a point cloud U sampled from X consider the following question: for a 
radius r, how can we infer whether or not any given pair of points in U has the 
same local structure at this radius? In this subsection, we prove a theorem which 
describes the circumstances under which we can make the above inference. Nat- 
urally, any inference will require that we use U to judge whether or not the maps 
(j> x (p, q, r) are isomorphisms. The basic idea is that if U is a dense enough sample 
of X, then the (co)kernel diagrams defined by U will be good enough approxima- 
tions of the diagrams defined by X. 

3.2.1 (Co)Kernel Stability 

Again we fix p, q, and r, and write X = X (p, q, r). For each a > 0, we let 
U a — d^lOja]. We consider 4>a = ^(p, g, r), defined by replacing X with 
U a in dTb. Running a from to oo, we obtain two more persistence modules, 
{ker </>j} and {cok0„ }, with diagrams Dgm(ker cj> u ) and Dgm(cok</> t/ ). 

If U is a dense enough sample of X, then the (co)kernel diagrams defined by 
U will be good approximations of the diagrams defined by X. More precisely, we 
have the following easy consequence of Theorem |2,l| 

Theorem 3.1 ((Co)Kernel Diagram Stability) The bottleneck distances between 
the (co)kernel diagrams of (f> u and X are upper-bounded by the Hausdorff dis- 
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tance between U and X: 

d s (Dgm(ker0 C7 ),Dgm(ker< ? !> x )) < d H {U,X), 
ds (Dgm (cok ^ Dgm (cok < d H (U,X). 

PROOF. We prove the first inequality; the proof of the second is identical. Put 
e — dn{U, X). Then, for each a > 0, the inclusions U a X n +j and X Q 
U a +e induce maps kert/^ — > ker</> x +e and ker</!> x — > ker<^ +E . These maps 
clearly commute with the module maps in the needed way, and hence we have the 
required e-interleaving and can thus appeal to Theorem |2.1| EH 



3.2.2 Main Inference Result 

We now suppose that we have a point sample U of a space X, where the Haus- 
dorff distance between the two is no more than some e; in this case, we call U an 
e-approximation of X. Given two points p,q £ U and a fixed radius r, we set 
<j/> x _ x (p ) g ] r ) 5 anc [ we wish to determine whether or not </> x is an isomorphism. 
Since we only have access to the point sample U, we instead compute the diagrams 
Dgm (ker <f> u ) and Dgm (cok (j) U );v/e provide an algorithm for doing this in Sec- 
tion [5] The main Theorem of this section, Theorem |3.2| gives conditions under 
which these diagrams enable us to answer the isomorphism question for </!> x . To 
state the theorem we first need some more definitions. 

Given any persistence diagram T>, which we recall is a multi-set of points in 
the extended plane, and two positive real numbers a < b, we let D (a, b) denote the 
intersection of T> with the portion of the extended plane which lies above y — b 
and to the left of a; = a; note that these points correspond to classes which are 
born no later than a and die no earlier than b. 

For a fixed choice of p, q, r, we consider the following two persistence mod- 
ules: {H(B x (a),<9B x )} and{H(.Bj 9 (»,aB x 9 )}. We let a(p, r) and a(p, q, r) 
denote their respective feature sizes and then set p{p, q, r) to their minimum. 

We now give the main theorem of this section, which states that we can use U 
to decide whether or not 4> x (p, q, r) is an isomorphism as long as p(p, q, r) is large 
enough relative to the sampling density. 

Theorem 3.2 (Topological Inference Theorem) Suppose that we have an e-sample 
U from X. Then for each pair of points p,q £ 1H N such that p = p(p, q, r) > 3e, 
the map X = X (p, q, r) is an isomorphism iff 

Dgm (ker 4> u ) (e, 2e) U Dgm (cok <f ) (e, 2e) = . 



Proof. 

To simplify exposition, we will refer to points in Dgm (ker x )uDgm (cok X ) 
and Dgm(ker0 (7 ) U Dgm(cok cj> u ) as X-points and (7-points, respectively. 

Whenever 0<a< / 3<3e<p, the two vertical maps in diagram |2j 
will by definition both be isomorphisms. Hence the maps ker<^> x — > ker0 x and 
cok</> x — > cok0 x must also be isomorphisms, and so, as a increases from 
to 00, any element of the (co)kernel of (j) X must live until at least 3e, and any 
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e 2e 3e 

Figure 8: The point in the X-diagrams lie either along the solid black line or in the darkly shaded 
region. Adding the lightly shaded regions, we get the region of possible points in the U -diagrams. 

(co)kernel class which is born after must in fact be born after 3e. In other words, 
any X-point must lie either to the right of the line x — 3e, or along the y-axis 
and above the point (0, 3e); see Figure [8] Recall that <^ x is an isomorphism iff 
ker</> x = = cok^ x . Thus </> x is an isomorphism iff the black line in Figure[8] 
contains no X-points. 

On the other hand, Theorem |3 . 1 1 require s that every [/-point must lie within e 
of an X-point. That is, all U -points are contained within the two lightly shaded 
regions drawn in Figure [8] Since the rightmost such region is more than e away 
from the thick black line, there will be a U -point in the left region iff there is an 
X-point on the thick black line. But the U -points within the left region are exactly 
themembersof Dgm(ker ^(e, 2e) U Dgm(cok?i u )(e, 2e). ED 



Examples. Here we give two examples illustrating the topological inference 
theorem. 

For the first example, suppose we have the space X in the left half of Figure 
[9] and we take the labelled points p and q and the radius r as drawn; in this case, 
one can show that p(p, q, r) — 8.5, which here is the distance between the line 
segment and the boundary of the intersection of the two r-balls. First we compute 
the (co)kernel persistence diagrams for <ji x , showing the kernel diagram in the right 
half of Figure|9] Since the y-axis of this diagram is free of any points (and the same 
holds for the un-drawn cokernel diagram), p and q have the same local structure at 
this radius level. 

On the other hand, suppose that we have an e-sample U of X, with e = 2.8 < 



p/3, as drawn in the left half of Figure 10 We can compute the analogous U 



diagrams, with the kernel diagram drawn in the right half of the same figure. Not- 
ing that the two rectangles defined by (e, 2e) in the two diagrams are indeed empty, 
and that the same holds for the cokernel diagrams, we can apply Theorem |3.2| to 
infer that the points have the same local structure at radius level r. 

For a second example, suppose X is the cross on the left half of Figure [11] with 
p, q, r as drawn. Then p and q are locally different at this radius level, as shown by 
the presence of two points on the y-axis of the kernel In the left half of Figure [T2| 
we show an e-sample U of X, with 3e < p(p, q, r). Note that the kernel diagram 
for cf> u does indeed have two points in the relevant rectangle. 
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8.5 



P \ I q 



8.5 12 birth 



Figure 9: Kernel persistence diagram of two local equivalent points, given X. 







• / 


f 

2 


8 5 


6 birth 



Figure 10: Kernel persistence diagram of two local equivalent points, given U . 




Figure 11: Kernel persistence diagram of two points that are not locally equivalent, given X. 
number, i.e., #2, labeling a point in the persistence diagram indicates its multiplicity. 
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Figure 12: Kernel persistence diagram of two points that are not locally equivalent, given U. 



3.3 Geometric Intuition 



Theorem 3.2 is stated in terms of a topological parameter, p — p(p, q, r). We can 
relate p to more geometrically-flavored quantities. Specifically, we will show that 
p is lower bounded by a parameter derived from local variants of reach, as well as 
from parameters related to the gradient of dx- Unfortunately, this lower bound can 
be quite loose, zero in certain cases, limiting its practical utility. It does provide a 
geometric intuition to the topological constraints on the point cloud. 

Recall that the medial axis M of an embedded space X is the subset of the 
ambient space consisting of all points which have at least two nearest neighbors 
on X, and that the reach r of X is defined by r = inf^ex dist (x,M). We fix 
notation for the following four intersections of A4 with different subsets of the 
ambient space: M(p,r) = MC\B r (p), M (j>, r) = MndB r (p), M(p, q,r) = 
M n B r (p) n B r (q), andMo(p,q,r) = M D d(B T (p) n B r (q)), and we define 
a variant of reach for each such space: 

r(p, r) = inf dist (x,A4(p, r)) 

To(p, r) = inf dvat{x,Ma(jp,r)) 

r(p, q, r) = inf dist (x,M(p, q, r)) 

r (p,q,r) = inf dist (x, Mo(p, q, r)). 

x£X 

Note that all four of these quantities are of course upper bounds on r itself. 

Letting Vx be shorthand for the gradient of dx, we define the following subset 
ofdBrip): 

G(p,r) = {y£ dB r (p) j V x (y) -L dB r (p)}, 

and then set r)(p, r) — inf^gx dist (a;, G{p, r)). We similarly define G(p, q, r) 
and rj(p,q,r), 



G( P , q, r) = {ye d(B r (p) PI B r (q)) \ V x (y) -L d(B r (p) nB r (q))}, 
n(p, 1, r) = inf dist (x, G{p, q, r)). 

x£X 

Given the above quantities the following lower bound holds. 
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Theorem 3.3 (Geometric lower bound) If we define 

7 = 7(P> <?, r ) = min{r(p, r), r(p, g, r), r?(p, r), ?j(p, g, r)}, 

fften p(p,q,r) > -y(p,q,r). 

The proof appears in Appendix [D] 

4 Probabilistic Inference Theorem 

The topological inference of Section [3] states conditions under which the point 
sample U can be used to infer stratification properties of the space X. The basic 
condition is that the Hausdorff distance between the two must be small. In this 
section we describe two probabilistic models for generating the point sample U, 
and we provide an estimate of how large this point sample should be to infer strat- 
ification properties of the space X with a quantified measure of confidence. More 
specifically, we provide a local estimate, based on p(p, q, r) and p(q,p, r), of how 
many sample points are needed to infer the local relationship at radius level r be- 
tween two fixed points p and q; this same theorem can be used to give a global 
estimate of the number of points needed for inference between any pair of points 
whose p- values are above some fixed low threshold. 

4.1 Sampling Strategies 

We assume X to be compact. Since the stratified space X can contain singularities 
and maximal strata of varying dimensions, some care is required in the sampling 
design. Consider for example a sheet of area one, punctured by a line of length 
one. In this case, sampling from a naively constructed uniform measure on this 
space would result in no points being drawn from the line. This same issue arose 
and was dealt with in |29], although in a slightly different approach than we will 
develop. 

The first sampling strategy is to remove the problems of singularities and vary- 
ing dimension by replacing X by a slightly thickened version X = Xa . We assume 
that X is embedded in R fe for some k. This new space is a smooth manifold with 
boundary and our point sample is a set of n points drawn identically and indepen- 
dently from the uniform measure /tt(X) on X, U = {xi, .x„} *~ /i(X). This 
model can be thought of as placing an appropriate measure on the highest dimen- 
sional strata to ensure that lower dimensional strata will be sampled from. We call 
this model Mi . 

The second sampling strategy is to deal with the problem of varying dimen- 
sions using a mixture model. In the example of the sheet and line, a uniform 
measure would be placed on the sheet, while another uniform measure would be 
placed on the line, and a mixture probability is placed on the two measures; for 
example, each measure could be drawn with probability 1/2. We now formalize 
this approach. Consider each (non-empty) i-dimensional stratum = Xi — Xi_i 
of X. All strata that are included in the closure of some higher-dimensional strata, 
in other words all non-maximal strata, are not considered in the model. A uniform 
measure is assigned to the closure of each maximal stratum, fn(Si), this is possible 
since each such closure is compact. We assume a finite number of maximal strata 
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K and assign to the closure of each such stratum a probability pi = 1 /K. This 
implies the following density 

1 K 

/(*) = ^X>( x=a: )> 

3=1 

where Vi is the density corresponding to measure pi . The point sample is generated 
from the following model: U = {xi, .x n } *~ f(x). We call this model M2. 

The first model replaces a stratified space with its thickened version, which 
enables us to place a uniform measure on the thickened space. Although this re- 
placement makes it convenient for sampling, it does not sample directly from the 
actual space. The second model samples from the actual space, however the sam- 
ple is not uniform on X with respect to Lebesgue measure. 



4.2 Lower bounds on the sample size of the point cloud 



Our first main theorem is the probabilistic analogue of Theorem |3.2| An immediate 
consequence of this theorem is that, for two points p,q £ U, we can infer with 
probability at least 1 — £ whether p and q are locally equivalent, p ~ r q. The 
confidence level 1 — £ will be a monotonic function of the size of the point sample. 

The theorem involves a parameter v(p), for each positive p, which is based on 
the volume of the intersection of p-balls with X. First we note that each maximal 
stratum of X comes with its own notion of volume: in the plane punctured by a 
line example, we measure volume in the plane and in the line as area and length, 
respectively. The volume vol (Y) of any subspace Y of X is the sum of the volumes 
of the intersections of Y with each maximal stratum. For p > 0, we define 

vol(B p/24 (3 ; )nX) 
x&s. vol (X) 

We can then state: 

Theorem 4.1 (Local Probabilistic Sampling Theorem) LetU — {xi, X2, x n } 
be drawn from either model Mi or M2, Fix a pair of points p, q £ M. N and a pos- 
itive radius r, and put p = min{p(p, q, r), p(q,p, r)}. If 

^ 1 A 1 , 1 



v (p) V v (p) C 

then, with probability greater than 1 — £ we can correctly infer whether or not 
x (p, q, r) and (j> x (q,p, r) are both isomorphisms. 

Proof. 

A finite collection U = {xi,X2, x n } of points in l" is e-dense with re- 
spect to X if X C U e ; equivalently, U is an e-cover of X. Let C(e) be the e- 
covering number of X, the minimum number of sets B E flX that cover X. Let 
P(e) be the e-packing number of X, the maximum number of sets B E nX that can 
be packed into X without overlap. 

We consider a cover of X with balls of radius p/12. If there is a sample point in 
each p/12-ball, then U will be an e-approximation of X, with e < 4(p/12) = p/3. 
This satisfies the condition of the topological inference theorem, and therefore we 
can infer the local structure between p and q. 

The following two results from 1 28] will be useful in computing the number of 
sample points n needed to obtain, with confidence, such an e-approximation. 
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Lemma 4.1 (Lemma 5.1 in 1 28 1) Let {Ai, A2, ...,Ai} be a finite collection of 
measurable sets with probability measure /i on U l i=1 Ai, such that for all Ai, 
H(Ai) > a. Let U — {xi, X2, x n } be drawn iid according to fi. If n > 
i (log I + log |), then, with probability 1 — f , Vi, U H A t ^ 0. 

Lemma 4.2 (Lemma 5.2 in |28|) Let C{e) be the covering number of an e-cover 
o/X and P{e) be the packing number of an e-packing, then 

P(2e) < C(2e) < P(s). 

Again, we consider a cover of X by balls of radius p/12. Let G X be 

the centers of the balls contained in a minimal sub-cover. Put A; = -Bp/i^J/i) 1 "^- 
Applying Lemma |4~T| we obtain the estimate 

n> — ( logZ + log ~ 



where I is the /9/12-covering number, and a = min; 
Applying Lemma [42] 



vol (Aj) 
vol (X) ' 



; " W ' - vol (B p/24 n X) - u(p) 
On the other hand, - < -4^ by definition, and the result follows. EH 

To extend the above theorem to a more global result, one can pick a positive 
p and radius r, and consider the set of all pairs of points (p, q) such that p < 



min{p(p, q, r) , p(q, p, r}. Applying Theorem 4. 1 uniformly to all pairs of points 
will give the minimum number of sample points needed to settle the isomorphism 
question for all of the intersection maps between all pairs. 



5 Algorithm 

The theorems in the last sections give conditions under which a point cloud U, 
sampled from a stratified space X, can be used to infer the local equivalences 
between points on X and its surrounding ambient space. We now switch gears 
slightly, and imagine clustering the ^/-points themselves into strata. The basic 
strategy is to build a graph on the point set, with edges coming from positive iso- 
morphism judgements. The connected components of this graph will then be our 
proposed strata. We begin by describing this strategy in Section [5T| Some of its 
potential limitations are discussed in Section [5^2] where we also describe a more 
robust variant based on graph diffusion. 

A crucial subroutine in the clustering algorithm is the computation of the dia- 
grams Dgm(ker (f> u ) and Dgm(cok 4> u ), for (f> u = 4> u (p, q, r) between all pairs 
(p, q) £ U x U. The algorithm for this sub-routine is quite complicated, we de- 
scribe it in detail in Section [53] The correctness proof is even more complicated; 
we give a proof sketch in Section [J!4] deferring all major details to Appendix [C] 

We would like to make clear that we consider the algorithm in this paper a first 
step and several issues both statistical and computational can be improved upon. 
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5.1 Clustering 



We imagine that we are given the following input: a point cloud U sampled from 
some stratified space X, and a fixed radius r. We make the assumption that 
dn{U, X) < e < where p m i n is the minimum of p(p,q,r) for all pairs 

(p, q) G U x U. Later we discuss the consequences when this assumption does 
not hold and a possible solution. 

We build a graph where each node in the graph corresponds uniquely to a 
point from U. Two points p,q £ U (where \\p — q\\ < 2r) are connected by 
an edge iff both <^> x (p, q, r) and (j) X (q,p,r) are isomorphisms, equivalently iff 
Dgm(ker c/ )(e, 2e) and Dgm(cok<^ c/ )(e, 2e) are empty. The connected com- 
ponents of the resulting graph are our clusters. A more detailed statement of this 
procedure is giving in pseudo-code, see Algorithm[TJ Note that the connectivity of 
the graph is encoded by a weight matrix, and our clustering strategy is based on a 
0/1-weight assignment. 



Algorithm 1 Strata-Inference([7, r, e) 
for all p, q <E U do 
if ||f>-<z|| > 2rthen 

W(p,q) = 
else 

Compute Dgm(ker 4> u (p, q, r)) and Dgm(cok <fi u (p, q, r)) 
Compute Dgm(ker 4> u (q,p, r)) and Dgm(cok (f> u (q,p, r)) 
if Dgm(ker u (p, q, r))(e, 2e) U Dgm(cok(/) C/ (p, q, r))(e, 2e) ^ then 
W(p,q)=0 

else if Dgm(ker0 c/ ((j,p,r))(e,2e) UT)gm(cok<j) U (q,p,r))(e,2e) ^ then 

W(p,q)=0 
else 

W(p,q) = l 

end if 
end if 
end for 

Compute connected components based on W. 



5.2 Robustness of clustering 

Two types of errors in the clustering can occur: false positives where the algorithm 
connects points that should not be connected and false negatives where points that 
should be connected are not. The current algorithm we state is somewhat brittle 
with respect to both false positives as well as false negatives. We will suggest a 
very simple adaptation of our current algorithm that should be more stable with 
respect to both false positives and false negatives. 

The false positives are driven by the condition in Theorem |3.2| that p m i n < 3e, 
so if the point cloud is not sampled fine enough we can get incorrect positive iso- 
morphisms and therefore incorrect edges in the graph. If we use transitive closure 
to define the connected components this can be very damaging in practice since a 
false edge can collapse disjoint components into one large cluster. 
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The false negatives occur because our point sample U is not fine enough to 
capture chains of points that connect pairs in U through isomorphisms, there may 
be other points in X which if we had sampled then the chain would be observed. 
The probability of these events in theory decays exponentially as the sample size 
increases and the confidence parameter £ in Theorem |3.2| controls these errors. 

We now state a simple adaptation of the algorithm that will make it more robust. 
It is natural to think of the 0/1-weight assignment on pairs of points p, q £ U as 
an association matrix W. A classic approach for robust partitioning is via spectral 
graph theory 1 26] [24] [9). This approach is based an eigen-decomposition of the 
the graph Laplacian, L = D — W with the diagonal matrix Du = Wij . The 
smallest nontrivial eigenvalue Ai of W is called the Fiedler constant and estimates 
of how well the vertex set can be partitioned 1 17 1. The corresponding eigenvector 
Vi is used to partition the vertex set. There are strong connections between spectral 
clustering and diffusions or random walks on graphs [£)• 

The problems of spectral clustering and lower dimensional embeddings have 
been examined from a manifold learning perspective IT1l2l ll8l . The idea central 
to these analyses is given a point sample from a manifold construct an appropriate 
graph Laplacian and use its eigenvectors to embed the point cloud in a lower di- 
mensional space. A theoretical analysis of this idea involves proving convergence 
of the graph Laplacian to the Laplace-Beltrami operator on the manifold and the 
convergence of the eigenvectors of the graph Laplacian to the eigenvalues of the 
Laplace-Beltrami operator. A key quantity in this analysis is the Cheeger constant 
which is the first nontrivial eigenvalue of the Laplace-Beltrami operator [8J. An 
intriguing question is whether the association matrix we construct from the point 
cloud can be related to the Laplacian on high forms. 

5.3 Diagram Computation 

We now describe the computation of the diagrams Dgm(ker cj> U ) and Dgm(cok (j) U ). 
To do this, we need for each a > a simplicial analogue of the map 

: H(B?{a),dB%{a)) -> H(B£(a), dB"(a)). 

To produce this, we first define, for each a > 0, two pairs of simplicial complexes 
Lo(a) C L(a) and Ko(a) C K(a), and z relative homology map 

tp a : H(L(a),L (a)) ->■ H(K(a),K (a) 

between them. We will then show that 

Dgm(ker0 c/ ) = Dgm(ker-i/') and Dgm(cok<^ c/ ) = Dgm(cok-i/>). 

To compute the diagrams involving tp, we reduce various boundary matrices; since 
we follow very closely the (co)kernel persistence algorithm described in 1121 . we 
omit any further details here. 

5.3.1 Complexes 

To construct the simplicial complexes in our algorithm, we take the nerves of sev- 
eral collections of sets which are derived from a variety of Voronoi diagrams of 
different spaces. Here we briefly review these concepts. 
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Nerves. The nerve N(C) of a finite collection of sets C is defined to be the 
abstract simplicial complex with vertices corresponding to the sets in C and with 
simplices corresponding to all non-empty intersections among these sets; that is, 
N(C) = {S C C | Pi S 7^ 0}. Every abstract simplicial complex can be geomet- 
rically realized, and therefore the concept of homotopy type makes sense. Under 
certain conditions, for example whenever the sets in C are all closed and convex 
subsets of Euclidean space ([15], p. 59), the nerve of C has the same homotopy 
type, and thus the same homology groups, as the union of sets in C. 

Voronoi diagram. If U is a finite collection of points in R k and m G U, then 
the Voronoi cell of «; is defined to be: 

Vi = V(ui) = {i£l' | \\x - m\\ < \\x - MjIIjVmj G U}. 

The set of cells Vi covers the entire space and forms the Voronoi diagram of R fc , 
denoted as Voi((7|R fc ). If we restrict each Vi restricted to some subset X C 
K fe , then the set of cells Vi D X forms a restricted Voronoi diagram, denoted as 
Voi (U\X). For a simplex ct G 17, we set V CT = f] Uiecr Vi. 

The nerve of the restricted Voronoi diagram Voi (U\X) is called the restricted 
Delaunay triangulation, denoted as Del (U\X). It contains the set of simiplices a 
for which V a n X / 0. 

Power cells. An important task in our algorithm is the computation of the rel- 
ative homology groups H(B%(a), dBp 1 (a)) and H(B^ q (a), dB% q (a)). Now to 
compute H(U a ), the absolute homology of the thickened point cloud, we would 
need only to compute the nerve of the collection of sets Vi PI U a . This is because 
each such set is convex and their union obviously equals the space U a . Such a 
direct construction will not work in our context, for the simple reason that the sets 
Vi n dBp (a) and Vi n dBp q (a) need not be convex. 

To get around this problem, we first define P(a), the power cell with respect 
to B r (p), to be: 

P(q) = |x -p|| 2 - r 2 < \\x- u\\ 2 - a 2 ,Vu G U}, (4) 

and we set Po{a) = B r (p) — intP(a). To define Q(a), the power cell with 
respect to B r (q), we replace p with q in (Q. Finally, we setZ(a) = P(a)nQ(a), 
andZo(a) = (B r (p)r\B r (q))— int Z(a). This is illustrated in Figure[T3| We note 
that Po(a) and Zo(a) are both contained in U a , as can be seen by manipulating 
the inequalities in their definitions. 

Replacing dBp (a) with Po(a) and dBp q (a) with Zo(a) has no effect on the 
relative homology groups in question, as is implied by the following two lemmas. 
The first lemma was proven in 0; a proof of the second appears in Appendix [C] 

Lemma 5.1 (Power Cell Lemma) Assume B r (p) — Po(a) ^ 0. The identity on 
Bp (a) is a homotopy equivalence of (B p (a) , d Bp (a)) and (Bp (a), Po(a)), 
as a map of pairs. 

Lemma 5.2 (Intersection Power Cell Lemma) Assume B r (p)nB r (q)—Zo(a) ^ 
0. The identity on Bp q (a) is a homotopy equivalence of (Bp q (a) , dBp q (a)) and 
(Bp q (a), Zo(a)), as a map of pairs. 
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Figure 13: Illustration of intersection power cell Z(a), as the shaded region. The unshaded 
convex regions are P(a) and Q(a) respectively. 




Figure 14: Illustration of the lune and the moon. The shaded regions are the respective moons. 
The white regions within solid circles are the respective lunes. 

Lune and moon. It can be shown (|3|) that the sets Vt n Po(a) are convex. 
Unfortunately, it is still possible for some set V; D Zo (a) to be non-convex. To fix 
this, we must further divide the Voronoi cells in a manner we now describe. 

We consider the hyperplane P of points in R fe which are equidistant to p and q; 
we often refer to this hyperplane as the bisector. This will divide R fc into two half- 
spaces; let P p and P q denote the half-spaces containing p and q, respectively.. We 
also define the p-lune, L p — P q n B r (p), and the p-moon, M p = P v D B r (p), as 
illustrated in Figure [14] The lune and the moon divide each Voronoi cell into two 
parts, Vt = Vifl Lj, and V^ M = Vi PI M p . These sets are convex, assuming they are 
non-empty, since they are each the intersection of two convex sets. Furthermore, 
we have the following lemma whose simple but technical proof we omit: 

Lemma 5.3 (Convexity Lemma) The sets Vt PI Zo(a) and V t M n Zo(a) are all 
convex, assuming they are non-empty. 

Of course the nonempty sets among H Po{a) and D Poia) will also be 
convex. 

To construct the simplicial complexes needed in our algorithm, we define A 
to be the collection of the nonempty sets among Vt PI Bp (a) and V^ M n Bp (a), 
and we define Ao to be the collection of the nonempty sets among Vt n Po (ct) 
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Figure 15: Illustration of the simplicial complexes constructed around two points p and q. 
underlying Voronoi decomposition of the space is shown in thin dotted lines, wi and 112 in U 
the points whose restricted Voronoi regions intersect with the lune at non-convex regions. 

and V™ n Pb(a). Note that UA = Bp (a) and UA = Po(a). Taking the 
nerve of both collections, we define the simplicial complexes L(a) = N(A) and 
L (a) = N(A ). 

Similarly, we define C and Co to be the collections of the nonempty sets among, 
respectively, Vl n Bp, (a) and V™ n Bp, (a), and Vt n Z (a) and V™ n Z (a). 
And we define K(a) = N{C) and K (a) = AT (Co). 

An example of the simplicial complexes constructed in R 2 for a given U are 
illustrated in Figure[l5] A direct approach to construct these simplicial complexes 
runs into difficulties as the corners of the convex sets created by the bisector can 
be shared by many sets; we defer the technicalities to Appendix [B] 

5.3.2 Maps 

We now construct simplicial analogues 

^ a : H(i(a),Lo(o)) -> H(K(a),K {a). 

of the maps 0„ , 

The containments Lo(a) C L(a) and ifo(a) C K(a) are obvious. In order 
to define we first need the following technical lemma: 

Lemma 5.4 (Containment Lemma) Assume that a simplex a is in Lo (a). If a is 
also in K(a), then a is in Ko(a), as well. 

PROOF. . By definition, a G Lo(a) iff there exists some point x G V CT n Po{a). 
We must show that the set V" PI Zo (a) is non-empty. Note that x £ Pq (a) implies 
that x 6 B r (p), while x £ int P(a) implies that x £ int Z(a). If x G B r (q), 
then we are done, since Zo(a) — B r (p) n B r (q) — int Z(a). 

Otherwise, choose some point y G V T nL r a nB r (p)nB, (g), which is possible 
since <r G if (a). Since both x and y belong to the same convex set V n U a PI 
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B r (p), there exists a directed line segment 7 from x to y within this set connecting 
them. We imagine moving along 7 and first we suppose that 7 intersects B r (q) 
before it intersects intQ(a). Let z be the first point of intersection. Then 2 G 
B r (p) n B r (q), z g int Q{a). Therefore z £V a C\ Z (a), On the other hand, we 
may prove by contradiction that it is impossible for 7 to intersect Q(a) before it 
intersects B r (q). Let z' be the first point of such an intersection. Since z G Q(a), 
by definition || z' —q\ | 2 — r 2 < \z' — Ui | | 2 — a 2 , Vu^ G U. Since z' G [7 a ,Vu; G a, 
\\z' - Ul \\ 2 -a 2 < 0. Therefore | \z'-q\ | 2 - r 2 < \z -Ui\ | 2 - a 2 < 0, Vu; G cr. 
Since z' is outside B r (q), \ \z' — q\\ 2 — r 2 > 0. This is a contradiction. E3 

To define ij) a , we first construct a chain map g — g a : C(L(a)) — > C(K(a)) 
as follows. Given a simplex a G L(a), we define g(a) = cr if a G A'(a), 
and c/(cr) = otherwise; we then extend g to a chain map by linearity. Using 
the Containment Lemma, we see that g(C(Lo(a))) C C(A"o(a)), and thus g 
descends to a relative chain map / = f a : C(L(q), Lo(a)) — > C(K(a), Ko(a)). 
Since / clearly commutes with all boundary operators, it induces a map on relative 
homology, and this is our ip = ip a . 



5.4 Correctness 

We show that our algorithm is correct by proving the following theorem. A sketch 
of the proof is given here, with the details deferred to Appendix [C] 

Theorem 5.1 (Correctness Theorem) The persistence diagrams involving sim- 
plicial complexes are equal to the persistence diagrams involving the point cloud, 
that is, Dgm(kerc6 c/ ) = Dgm(ker7/;) and Dgm(cok 4> v ) — Dgm(cok^). 

Proof sketch. To prove Theorem |5.l[ we will prove, for each a < /3, that the 
following diagram (as well as a similar diagram involving cokernels) commutes, 
with the vertical maps being isomorphisms. 

. . . — >ker (f)^ — > ker cf>p — y . . . 

. . . — >ker^> a — > keri/>a (5) 



Applying Theorem 2.2 then finishes the claim. Dgm(kerc6 ) = Dgm(ker-!/>) 
and Dgm(cok <j> u ) — Dgm(coki/>). 



6 Simulations 

We use a simulation on simple synthetic data with points sampled from grids to 
illustrate how the algorithm performs. In these simulations we assume we know e, 
and we run our algorithm for < a < 2e. The data sets are shown in Figure [T6] 

We use the following results to demonstrate that the inference on local struc- 
ture, at least for these very simple examples, is correct. As shown in Figure [T7| 
top, if two points are locally equivalent, their corresponding ker/cok persistence 
diagrams contain the empty quadrant prescribed by our theorems, while in Figure 
[T7] bottom, the diagrams associated to two non-equivalent points do not contain 
such empty quadrants. Similar results are shown for other data sets in Figure [T8| 
and Figure[T9l 
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Figure 16: From left to right: points sampled from a cross; points sampled from a plane inter- 
secting a line; points sampled from two intersecting planes. All points are located on the grid. 




Figure 17: Top: both points are from 1-strata. Bottom: one point from 0-strata, one point from 
1-strata. Left part shows the locations of the points. Right part shows the ker/cok persistence 
diagram of two points respectively, if the diagrams are the same, only one is shown. A number 
labeling a point in the persistence diagram indicates its multiplicity. 
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#2 




Figure 18: Top: one point from 0-strata, one point from 2-strata. Middle: one point from 0- 
strata, one from 1-strata. Bottom: both points are from 2-strata. A number labeling a point in 
the persistence diagram indicates its multiplicity. 
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Figure 19: Top: both points from 1-strata. Middle: one point from 1-strata, one from 2-strata. 
Bottom: both points are from 2-strata. A number labeling a point in the persistence diagram 
indicates its multiplicity. 
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7 Discussion 



As the title of the paper suggests we have presented a first step towards learning 
stratified spaces. In the discussion we state some future problems and extensions 
of interest. 

Algorithmic efficiency: The algorithm to compute the (co)kernel diagrams from 
the thickened point cloud is based on an adaption of Delaunay triangulation and 
the power-cell construction. This algorithm should be quite slow when the dimen- 
sionality of the ambient space is high due to the runtime complexity of Delaunay 
triangulation. One idea to address this bottleneck is to use Rips or Witness com- 
plexes 1 13 1; at the moment, we are not sure how to approach a proof of correctness, 
due to problems presented by the boundaries of the r-balls. Another approach is to 
use dimension reduction techniques such as principal components analysis (PCA) 
or random projection that approximately preserve distance 1 10 1 as a preprocessing 
step. Another idea that may work if the ambient dimension is not too high is using 
faster algorithms to construct Delaunay triangulations |4|. 

Weighting local equivalence: Currently we use a graph with 0/1 weights based 
on the local equivalence between two points. Extending this idea to assign frac- 
tional weights between points is appealing as it suggests a more continuous metric 
for local equivalence. This may also allow for greater robustness when using spec- 
tral methods to assign points to strata. 

Curvature moderated tubes: Markus J. Pfiaum 1 30 1 introduced a concept called 
curvature moderation that regulates the behavior of the tangent spaces of a stratum 
near the boundary. In other words, a stratum is curvature moderate if it curves near 
the boundary in a controlled way, this includes the higher derivatives of the curva- 
ture. This is yet another way to describe how strata and their tubular neighborhood 
are "glued together nicely". It would be interesting to connect this concept to our 
idea of "local reach". 

Noisy data: Our sampling models draw points from the underlying topological 
space. A more general model would sample points that are concentrated on the 
topological space. A version of this type of sampling model is discussed in |28|. 
It would be of interest to study how well our approach is suited to such a model. 

Adaptive sampling conditions: Throughout this paper we use e-approximation 
to characterize the similarity of the point sample to the topological space. There 
are other approximation criteria that may be interesting to study and may provide 
better sampling estimates. One such criterium is the e-sample 1141 which is adap- 
tive in that it is proportional to the local feature size. Another criterium of possible 
interest is the weak feature size |7 1. 
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Appendices 



A Defining the Map 

We give a more precise description of the map 

* = H{B^{a),8B^{a)) -> H(B£(a),0B&(a)). 

The definition will be made on the chain level and will be given in terms of singular 
chains. 

A.l Background 

We give here some necessary background as well as some material from algebraic 
topology and homological algebra which will be needed in Appendix [C] Most of 
the descriptions are adapted from [22 1 and [27]. 

Chain homotopies. For our purposes, a chain complex C is a sequence of 
Z/2Z- vector spaces C v , one for each integer p, connected by boundary homo- 
morphisms dp : C v — > C p _i such that <9 p _i o d p = for all p. The p-th 
homology group of such a chain complex is defined by H p = ker <9 p /im d p +i. 

A chain map 77 : C — > T> between two chain complexes is a sequence of homo- 
morphisms r\ v : C p — > D p which commute with the boundary homomorphisms: 
dp o ri P = rj p ^i o dp . Every chain map induces a map 77* between the homology 
groups of the two complexes. 

A chain homotopy F between two chain maps r/, 77' : C — > T> is a sequence of 
homomorphisms F p : C p — > D P +i which satisfy the following formula for each 
p: T? — ??' = d p+ ioFp-Fp-10 dp . We say that 77 and 77' are chain homotopic and 
note that they must then induce the same maps on homology: 77, = Finally, 77 
is called a chain homotopy equivalence if there exists a chain map p :T> — > C such 
that 77 o p and p o 77 are both chain homotopic to the identify. In this case 77 and p 
will both induce homology isomorphisms. 

Singular homology. The standard p-simplex is the subset of W +1 given by 

v 

A p = {(t ,...,t p ) £ R p+1 |^ti = l,Vi,ti > 0}. 

! = 

The p + 1 vertices of A p are points {ei} C R p+1 where 

e = (1,0,0,...,0), 
ei = (0,1,0,...,0), 

e p = (0,0,0,...,1). 

A singular p-simplex of a topological space X is a continuous map 8 : A p — > 
X. By taking formal sums of singular simplices (using binary coefficients for our 
purposes) one forms C P (X), the singular chain group of X in dimension p. Given 
points ao, ap in some Euclidean space, which need not be independent, there is 
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a unique affine map I of A p that maps the vertices ej of A p to 04. This map defines 
the linear singular simplex determined by ao, a p , denoted as l(ao, ...,a p ). One 
then defines a boundary homomorphism d p : A P (X) — > A p -i(X) by: 

dp(8) = Ef =Q ((5oZ(eo, ...,£«,..., e P )), 

and defines the singular homology groups H P (X) as above. A continuous map 
/ from X to another topological space Y induces a chain map /# : C P (X) — > 
C p (y) given by the formula = / o S, and thus also a homology map 

/. :H,:.V:. >H,;V:. 

The minimal carrier of a singular simplex 8 is its image S(A P ), and the min- 
imal carrier of a singular p-chain ^ Si is the union of the minimal carriers of the 
St. 

Isomorphism between simplicial and singular homology. The (simpli- 
cial) homology groups of a simplicial complex K and the singular homology 
groups of its realization \K\ are isomorphic. To show an explicit isomorphism 
(| 27 1), we first define a chain map 

r, : C(K) ^ C(\K\) 

as follows 1 27 1 : choose a partial ordering of the vertices of K that induces a linear 
ordering on the vertices of each simplex of K. Orient the simplices of K by using 
this ordering, and define 

T]([vo, -,v p ]) = l(v , ...,v p ), 

where «o < ... < v p in the given ordering. We refer to the linear singular sim- 
plex l(vo, ...jVp) as a simplicial linear singular simplex and it is important in the 
subsequent sections. The chain map r\ is in fact a chain equivalence as it has a 
chain-homotopy inverse, for which a specific formula can be found in 1 16 1. Hence 
the induced homology map 77, provides an isomorphism of simiplicial with singu- 
lar homology. 

A.2 Intersection Map Details 

We now give the full and formal definition of the homology map (fi — (fi^ , starting 
on the chain level. For compactness, we will use the following shorthand: 

I = S>) = !7 a nB r (p), 
B = dB^(a) = U a n dB r {p), 
S = B^ q (a) = u a n B r (p) n B r (q), 
A = U a nB r (p) -int(S), 

u = u a n B r (p) - s. 

Note that X ~ U = S — B pq (a) and A — U = dS = dB^ q (a). So to define 
ij> we need only define a chain map j : C(X, B) — > C(X — U, A — U) and then 
take <j) as the map induced on homology. The map j is defined as the composition 
j = k o i. The chain map i : C(X, B) — > C(X, A) is induced by inclusion on 
the second factor, while the chain map k : C'(X, A) C'(X — U, A — U) is an 
excision, although this latter statement requires further elaboration. 
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Excisions. The inclusion map of pairs (X — U, A — U) — > (X, A) is called an 
excision if it induces a homology isomorphism; in this case one says that U can be 
excised. We will make use of the following two results about excision (see, e.g., 
ED): 

Theorem A.l (Excision Theorem) If the closure ofU is contained in the interior 
of A, then U can be excised. 

Theorem A.2 (Excision Extension) Suppose V C U C A and 

( i) V can be excised. 

( ii) (X — U, A — U) is a deformation retract of (X — V, A — V). 
Then U can be excised. 

In our context, the closure of U need not be contained in the interior of A, and so 
we must define a suitable V C U. Although there are many ways to do this, one 
direct way is to choose some small enough positive 8. 

I = U a n d(B r (p) n B r (q)) n cl (U), 
Is = {xe cl(U)\di(x) < S}, 
V = U-h, 

where di(x) is the distance from x to the set I. It is straightforward to verify 
that V C U C A satisfies the hypotheses of Theorem |A.2| In other words, the 
inclusion of pairs (X — V, A — V) — > (X, A) is an excision; its induced chain 
map has a chain-homotopy inverse, which we denote as s : C(X, A) — > C(X — 
V, A — V). Finally, we define k = r# o s, where r# is the chain map induced by 
the retraction r : (X - V, A - V) ->• (X - U, A - U). 

Subdivision. In order to fully carry out the analysis in Appendix B, we must 
first decompose the maps i and k through subdivision. Given a topological space 
X and a collection A of subsets of X whose interiors form an open cover of X, 
a singular simplex of X is said to be A-small if its image set is entirely contained 
in a single element of A. For each dimension p, the chain group C*(X) is the 
subgroup of C P (X) spanned by the ,4-small singular p-simplices. These groups 
form a chain complex, with homology H A (X). Of course, any singular simplex on 
X can be subdivided into a sum of ,4-small simplices, so it is plausible, and in fact 
true (| 22 1), that the inclusion C A (X) -> C{X) is a chain homotopy equivalence. 

Returning to our context, we set A = {X — V, A} and denote by I the chain 
inclusion C A (X, A) -> C(X,A). We also let p : C(X,B) -> C A (X,B) be 
the chain homotopy inverse of the chain inclusion C A (X, B) — > C(X, B), and 
let t : C A {X,B) -> C A (X, A) be the chain map induced by inclusion on the 
second factor. Finally we note that i = I o t o p. 

We also decompose k as k = r^or/op, where r\ is the chain homotopy inverse 
of the chain map C(X — V,A — V)-¥ C(X, A) -> C A (X, A). 

Summary. To summarize, our map (j> — j», where j is the chain map defined 
by the following sequence of chain maps 

j — k o i = (r# o r] o p) o (I o t o p). 
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Following the same framework as above, we also define a chain map j' and its in- 
duced homology map 4>'=f* : H(BV{a),P (a)) -> H(B£(a), Z {a)) simply 
by adopting the notation: 



X = 


B%{a) = U a r\B r {p), 


B' = 


Ma), 


S = 


B% q {a) = U a D B r (p) n B r (q), 


A' = 


U a nB r (p)-S + Z (a), 


U = 


u a nB r ( P )-s, 


I = 


U a n d(B r ( P ) n B r { q )) n cl (U), 


Is = 


{x e c\(U)\di(x) < S}, 


V = 


U-Is, 



defining our open cover to be A' = {X—V, A'}, and otherwise proceeding exactly 
as before. 

Similarly, we create a chain map /' which induces tp' — f' t : H(|SdL|, |SdLo|) — > 
H ( | Sd K | , | Sd K \ ) , using the notation 



II 

X 


= Sd il, 


b" 


= |SdLo|, 


a" 


= (|SdL| - \SdK\) U \SdK 


u" 


= \SdL\ -int|SdK|, 


I 


= \SdK\ncl(U'), 


Is 


= {xe d(U')\di{x) < 5}, 


il 

V 


= U'-I e , 



with .4" = {A",X" — V"}. 

B Algorithm Details 

We give the details in constructing the simplicial complexes described in our al- 
gorithm. The various simplicial complexes, L, Lq, K and K , are the nerves of 
collections of convex sets. Here we go through the construction of L; construction 
of the others is similar. 

Implicit Perturbations. A direct approach to constructing L, the nerve of the 
collection B, runs into difficulties as the corners of the convex sets created by the 
bisector P can be shared by many sets. To cope with this difficulty, we perturb 
these convex sets ever so slightly so that they meet in general position. Note that 
this is not done by perturbing the bisector; rather, it is done by decomposing the 
bisector into pieces. 

We are interested in the restricted Voronoi diagram of the sublevel sets inside 
the ball B r (p), which we denote as V = Voi (U\U a n B r (p)). The restricted 
Voronoi cell of Ui is defined as V(ui\U a n B r (p)) = V(ui) n B r (p). 
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Given V, we create three sets of points. Let T vq be the set of points V4 £ 
U whose restricted Voronoi cells have non-trivial intersection with the bisector: 
V(ui\U a n B r (p)) n P 7^ 0. We impose an ordering of points in T pq , w.l.o.g., let 
the ordered set be T pq = {xi, X2, T P is the set of points m £ U which 

are not in T pq and are closer to p than they are to q. Similarly, T q is the set of 
points Ui £ U, which are not in T pq and are closer to q. 

By construction, the bisector P intersects the restricted Voronoi cells of the 
points in T pq . We denote these corresponding intersections as {Pi, P2, P m }- 
We perturb each Pi slightly such that no two pieces are collinear. Note that Pi is 
perpendicular to the direction q — p. One possible choice of perturbation would 
move each Pi within the restricted Voronoi cell along the direction q — p for ie 
distance, where e is sufficiently small. An example in R 2 is shown in Figure 
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Given such a perturbation, we let A be the resulting collection of perturbed 
convex sets, and we compute L = Nerve (A) instead of L — Nerve (A). By the 
properties of nerve construction, Nerve (A) — U A, Nerve (.4) ~ U A. Since 
|J A — |J A, then we have L ~ L. We now describe how we construct L. 



Case Analysis. Let L' be the restricted Delaunay triangulation, L' = Del(U\U a n B r (p)). 
We read the simplicies from A without explicit perturbations. Specifically, we fol- 
low a set of rules, described below, to construct L from L'. 

The bisector divides the restricted Voronoi cell of a point x £ T pq into two 
convex sets. Let x p represent the perturbed convex set closer to p in the nerve 
construction, and let x q represent the other set. Let a be a simplex in L' with k 
vertices, that is, a — [j/i, J/2, Vk\- There are seven cases regarding the member- 
ship of the points {yi , y% , ... , yu}. 

1. All yi £ a belong to T p . We add the simplex [yi, j/2, yt] to L. 

2. All J/,; £ a belong to T q . Same as case 1. We add the simplex [yi, 1/2, J/fc] 
to L. 

3. All j/i £ a belong to T pq . Suppose {3/1, j/2, yk} are sorted according to 
the ordering in T pq . We add the following simplicies and their faces to L: 

[y%, -,y v m ,y\,yl\ 

r V V Q Q Q1 

\y v m ,y\,yl, -,2/m] 



4. Some j/i are in T p , the rest are in T pq . Suppose {j/^ , yi n } C T p and 
{yji , -j yii } C T pq . We add [y n ,...,y in , y p H , y v n ] to L. 

5 . Some j/,: are in T q , the rest are m.T pq . Similar to case 4, suppose { j/i 1 , . . . , j/i n } C 
T q and {y h , ...,y k } C T pq . We add [y h ,...,y in , y q H ,-,?/?] to L. 

6. Some j/i are in T p , the rest are in T q . We show that Case 6 is impossible. 
Choose yi & T p and yj £ T q such that j/i and yj are connected by an edge. 
Since j/i and j/j are on the opposite sides of P, this edge must intersect P at 
some point z. Then their corresponding restricted Voronoi cells, V (yi | U a n 
B r (p)) and V(yj\U a PiB r (p)), must meet at a Voronoi face, which contains 
the point z. Suppose that the Voronoi face is in general position, that is, it is 
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Figure 20: An example of the implicit perturbation. Dotted lines are the bisectors. A simplex 
[yi, 2/2, 2/3] G £' is shown in the left. The simplices in L and in L are shown in the middle and 
right, respectively. 



not parallel to P. Then P intersects the Voronoi face, by definition, yi and 
Hj must belong to T pq . This is a contradiction. 

Some yi are in T v , some are in T q , and the rest are in T vq . We show that case 
7 is impossible using the same proof in case 6. 



A simple example is shown in Figure 20 Given [2/1,2/2,2/3] £ L' , simplex 
[2/1,2/2,2/3] ' s added to L according to case 4. Given [272,2/3] £ £, simplices 
[2/f, 2/3 , 2/f]> fe/3 , 2/2 > 2/f] an d their faces are added to L according to case 3. 

In summary, we construct L by iterating through all simplices a in L', adding 
new simplicies to L constructed from a following the above cases. 



C Algorithmic Correctness 

We prove the correctness of the algorithm described in Section [53] by proving 
Theorem |5.1 1 More precisely, we will prove that diagram [5] commutes, with the 
vertical arrows being isomorphisms, for some arbitrary but fixed choice of a < f3; 
we will omit the very similar argument about cokernels. The proof is unfortunately 
lengthy, and at times a bit technical, for in order to prove our statements about 
diagram[5] we must also prove similar statements about several other interlocking 
diagrams. For sanity and clarity of presentation, we first exhibit all the diagrams 
at once in the form of the following double-cube (Figure |2"T), 

C.l Bottom Face 

The bottom face of the double-cube has been detached and drawn in diagram [6] 
The horizontal maps in the upper square are induced by inclusion of pairs, and so 
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H(K(a),K (a)) 



inclusion • 



H(BV(a),dBV(a)) 



H(L(a),L (a)) -"J 
H{BV(a),P (a)) 



H(B%(a),dB%(a)) ± 




H(K(J3),K (J3)) 



H(BM,z (p)) 



H(B£(/J),0B£(/?)) 
H(L((3),L ((3)) 



H(B%(J3),P (J3)) 



H(B"U3),dB"(0)) 



Figure 21: Two adjacent commuting cubes. 
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the upper square certainly commutes. 



H(B"(a),dBV(a)) ^> H(£#(/J), dB*{fi)) 
H(B?{a),P (a)) ^ H(B?(P), P (j3)) 



t /la t fyfl 

H(L(a),L (a)) ^ H(L(P), L (p)). (6) 



We have already shown that the two vertical maps in the upper square are isomor- 
phisms; this was the content of the Power Cell Lemma in Section|53] To show that 
the vertical maps in the lower square are isomorphisms requires a bit more work. 
We make use of the following lemma, proven in (3). 

Lemma C.l (General Nerve Subdivision Lemma (GNSL)) LetC be the collec- 
tion of maximal cells of a CW complex, each a convex set in R fe . Define f : 
|Sd7V| — > UC by piecewise linear interpolation of its values at the vertices. If 
/(<t) is contained in the intersection of the cells that correspond to the vertices of 
a, for each simplex a 6 N, then f is a homotopy equivalence. 

The vertical isomorphisms in the bottom square then follow from this next lemma, 
where we may of course replace a with f3 if we wish. 

Lemma C.2 (Nerve Subdivision Lemma) Define h — h a : |SdL(a)| — ► Bp (a) 
on the vertices a of Sd L(a) by the formula 

h a (o) — arg min dfjtx) ~ d 2 (x), 
x£V<*nu a nB T (p) 

and extend it by linear interpolation. Then h a is a homotopy equivalence of pairs 
from (\Sd L(a)\,\SdL {a)\) to (B^ (a), P (a)). 

PROOF. By construction, h(a) is contained in the intersection of the cells that 
correspond to the vertices of a. By the GNSL then, h is a homotopy equivalence. 

Now we need to prove that the restriction of h to Sd Lo(a) is also a homotopy 
equivalence. Let a £ Lo(a) and put h(a) — z. For purposes of contradiction, 
suppose z Po(a). This means that z £ int P(a), by definition, and hence 
du(z) 2 ~d p (z) 2 > a 2 -r 2 . 

Now choose some z' £ V a n Po, which must exist since a G Lo(a). Then by 
definition we have d p (z') 2 — r 2 > du(z') 2 — a 2 , or d u (z') 2 —d p (z') 2 < a 2 — r 2 . 
Combining the above inequalities, we have du{z') 2 — d p (z') 2 < a 2 — r 2 < 
du(z) 2 — d p (z) 2 , which contradicts the assumption that h(a) — z. We conclude 
that z G Vo- PI Po(ct). Applying the GNSL once more finishes the proof. Ell 



To show that the lower square commutes, we put e = ja°h a ande' = hpog^, 
and we consider the map H : \L(a) \ x [0, 1] — > U a nB r (p) defined by H(x, t) — 
h at ° 9a* ( x )> where ctt = (1 — i)a + tfi. Since the maps g and j are inclusions 
and the maps h vary continuously with a, H isa homotopy between e and e'. This 
implies that the induced homomorphisms between the corresponding homology 
groups are the same, e* = e» . 
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C.2 Top Face 

We detach the top face of Figure[2T] drawing it in diagram[7] As before, we prove 
that all vertical maps are isomorphisms. The commutativity of the two smaller 
squares follows from nearly identical arguments to the ones used for the bottom 
face. 

H(B£(a),0B£(a)) -> H (Bp q (/?), dBp q (/?) ) 
H(fl£(a),Z (a)) H(B^G9),ZoG9)) 

t fta t h'g 

H(K(a),K (a)) H(KQ3), A (/?)) (7) 

The Intersection Power Cell Lemma tells us that the vertical maps in the top square 
are isomorphisms. As promised, we give the proof of this lemma here, repeating 
the statement for completeness. 

Lemma C.3 (Intersection Power Cell Lemma) Assume B r (p)C\B r {q)— Zq(q) ^ 
0. The identity i' on Bp q (a) is a homotopy equivalence of pairs between (Bp q (a) , dB, 
and(B^(a),Z (a)). 

PROOF. It suffices to show that the restriction of the identity, i' — i' a : dp q (a) — > 
Zo(a), is a homotopy equivalence. To do this, we first define a retraction j : 
Zo(a) — > dp q (a) as follows. Fix a point y G int Z(a), recalling that this set is 
nonempty by assumption. For each point x G Zo(a), we consider the unique ray 
starting at y and passing through x, and we let x' = j(x) denote its intersection 
with d(B r (p) n B r (q)). Note that x' G Zia) C U(a), and so j is certainly 
well-defined. That j is a retraction, meaning j o i' is the identity on dBp q (a), is 
obvious. On the other hand, the map 

A : Z (a) x [0,1] x Z (a) 

defined by A(is, t) = (1 — t)x + tx' is a homotopy between i' o j and the identity 
map on Zq (a), and so the claim follows. EH 

To prove that the vertical maps in the lower square are isomorphisms, we again 
make use of the GNSL. 

Lemma C.4 (Intersection Nerve Subdivision Lemma (INSL)) Define h! = h' a : 
|Sd K(a)\ -> Bp q (a) by setting 

h' a (a) — arg min min{dy (x) — dl(x), dfjtx) —- d 2 „(x)\, 

xevnu a nB r (p)nB T (q) 

where a is the barycentre of a G Ka, and then extending by linear interpoloation. 
Then h' is a homotopy equivalence of pairs between ( | Sd K(a) \ , | Sd Ko (a) | ) and 
(B«(a),Z (a)). 

PROOF. The proof is quite similar to that of the NSL. By construction, h'(o) is 
contained in the intersection of the cells that correspond to the vertices of cr, and 
so we need only prove that the restriction of h' to the barycentric subdivision of 
Ko(a) is also a homotopy equivalence. Let o G Aq(q) and put h'(a) = z. 

Supposes ^ Zo(a), and thus z G int Z(a). By definition then, d p (z) 2 — r 2 < 
du(z) 2 — a 2 and d q (z) 2 — r 2 < du(z) 2 — a 2 . In other words, mm{d 2 I (x) — 
d 2 p (x),d 2 u (x)~d 2 q (x)}>a 2 -r 2 . 



36 



Choose some point z £ V CT n Zo(a). Then one of the following inequalities 
must hold: d v {z') 2 -r 2 > du(z') 2 - a 2 , or d q {z') 2 ~ r 2 > d v {z'f -a 2 . That 
is, min{d^(z') - d 2 p {z),d 2 v {z) - c^(z')} < Q 2 - r 2 . 

Therefore, combining both inequalities, min{d^(2:')— d 2 (z'), d 2 J (z')—d 2 (z')} < 
a 2 — r 2 < min{d 2 J (z) — dp(z), d 2 J {z) — d 2 (z)}, which contradicts the definition 
of z. □ 



C.3 Left and Right Faces 

We now come to the final and most complicated part of the correctness proof, 
involving the left face (diagram [8]l of the double-cube; of course, everything we 
prove here will also hold for the right face. We have already established that all 
vertical maps are isomorphisms, and now must show that both squares commute. 

V\(Bp , dBp) — > V\(Bp q , dBp q ) 

H(B^,Po) — > H(Bp q ,Zo) 
t h, t K 

H(L,L ) ^ H(K,K ). (8) 

The top square will in fact commute even on the chain level. The bottom square is 
a little more complicated, and we start by addressing this first. 

In diagram[9] this bottom square has been expanded into two smaller squares 
of chain groups connected by chain maps. We show that the lower of these squares 
commutes on the chain level, and that the two choices of path across the upper 
square are connected by a chain homotopy. 

C(Bp , Po) C(Bp q , Zq) 
t h# t h' # 

C(|SdL|,|SdL |) A C(|SdJr|,|Sdi£:o|). 

C(SdL,SdL ) A C(SdK,SdK ). (9) 
C.3.1 Map Details 

First we need to discuss two of the horizontal chain maps from diagram[9]in more 
explicit detail. 

Upper map. We analyze the effect of j' on an arbitrary linear singular simplex 
oj : A p — y Bp , where uj = l(ao, a p ) for some points at in Euclidean space. 
The analysis can be broken up into three main cases: 

(A.l) cj(A p ) C B v q : Then j' maps ui through unchanged, meaning: 

[u : A p -»• Bp] ^Ilj:A p ^ Bp q ]. 
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Figure 22: Left: map j' for a linear singular simplex l(ao, Oi, 02). Right: map /' for a simplicial 
linear singular simplex l(vo, Vi,V2). 




Figure 23: Map j' for a linear singular simplex that requires barycentric subdivision. In this 
illustrated example, all four shaded regions are the images of the four singular simplexes in the 
first barycentric subdivision which are _4'-small and have their images in X — V. Their formal 
sum gives a singular chain in X — V. Their retraction result in a singular chain in B pq . 



From now on we simplify notation by omitting the domain and range of the 
singular simplex, writing instead: u) 



(A.2) oj(Ap) n Bg = 0: Then 10 ^ 0. 

(A.3) lj(A p ) £ andcj(A p ) n B^ ^ 0: here we have two sub-cases: 

(A.3. a) w is A' -small: This implies that ui(A p ) C X — V. Map j' can be 
interpreted as a retraction. That is, letting T = cj(A p ), S = ui(A p ) (~l 
Bq and R = u(A p ) n we define r : T -> 5" by: for x G 5, 

r(x) = x; for x G T — S, r(x) = x', where x' G i?, as shown in 

the left of Figure 
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Then lj r, where r : A p — > B pq is defined 

by: for e G A p where w(e) G S, r(e) = u){e)\ otherwise for e G A p 

where w(e) ^ S 1 , r(e) = ro w(e). 

(A.3.b) oj is not ^4'-small: We barycentrically subdivide uj enough times m 

until sd m uj is a ^l'-small singular chain. Then each _4'-small singular 

simplex in sd m u) that has its image in X — V follows the pattern of 

(A.3. a), resulting in a singular simplex : A p — > B pq . In the end we 

./ _ 
have, uj h^-> c t , where c T is the singular chain, c T — r 4 . This is 

shown in Figure [23] 



Middle map. We now describe the action of /' on an arbitrary simplicial linear 
singular simplex. Let 8 : A p — > |SdLj be such a simplex with 8 — r/(a) — 
1(vq, v p ), for some simplex a = [vq, v p ] G Sd-K". As above, we have three 
cases to consider:: 
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(B.l) 5(A P ) C \SdK\: then 5 A <5. 
(B.2) 5(A P ) n |Sdif| = 0: 5 A 0. 

(B.3) 5(A P ) £ \SdK\ and 5(A P ) n |Sd-K"| + 0: From Lemma 
know that (<5(A p )n|SdA|) C | Sd K \ , and so 5 A 0. 



C.5 



below, we 



Lemma C.5 Given a simplex a £ L, if a K and there exists r < a such that 
t G A, then t G Kq. 

Proof. Suppose there exists cj < r such that uj G A — Kq. This implies that V u 
is completely contained in Bp q . Since V is the intersection of V" with the partial 
Voronoi cells of vertices in a that are not in cj, then V a should be completely 
contained in Bp q . This means that a is in A, which leads to a contradiction. EH 



C.3.2 Lower Square 

As promised, we now show that the lower square in diagram[9]commutes. Choose 
an arbitrary a — [vo, v p ] G SdL, where each Vi is a barycenter of some sim- 
plex a' in L; as always, we assume that that the vertices are ordered by increasing 
dimension of their defining simplices. We have two cases: 

(C.l) (j G SdK: by definition, ry(cr) = l(vo, ...,w P ) has its image in |Sd A|, and 
f is the identify map, that is, a i— > a i— > ^(ct). Meanwhile, by case (B.l), 
a i— > 77(0-). Therefore (r\ o /)(<j) = /' o r7(cr). 

(C.2) cr ^ Sd A : then u^O^O.On the other had, since a & Sd A, we know 
that the image of 8 = r/(a) = /(no, cannot be entirely contained 

within jSd A"j. There are then two sub-cases to consider: 

(C.2.a) S(A P ) n Sd A| = : this is case (B.2). We have a A 5 A 0. 
(C.2.b) (5(A p )n!SdA'| C|SdA | : this is case (B.3). We have a A 8 A 0. 

C.3.3 Upper Square 

Finally, we show that the upper square in diagram[9]commutes up to chain homo- 
topy; that is, we will construct a chain homotopy D between the two chain maps 
e = j' o /i # and e' = h'# o /'. This will of course imply that e, = e' t ; in other 
words, that the induced homology diagram commutes. For clarity, we zoom in on 
diagram[9]and draw the relevant portion below as diagram [TO) 

C(Bp , P ) C(Bp q , Z ) 
t fc# t h' # 

C(|SdL|,|SdL |) A C(|SdA|, |SdA |). (10) 

For notational ease ,we set X = |SdL| and Y = Bp q . To construct D, we will 
define for each p a chain map F p : C P (X x /) — > C P (Y), and then we will set 
Dp = Ap+i o G p , where G p : C V (X x I) —t C p +i(X x I) is given by Lemma 
IColbelow. 
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X x I 



Y 



Figure 24: Case (D.l.a): illustration of F. 

Construction of F. Let n : X x I —> X be projection on the first factor, 
and fix an arbitrary simplicial linear singular simplex n : A v — > X x /. Then 
7t#(k) — 5 = 1(&q, (T p ), for some simplex a = [&o, a p ] in Sd L. We define 
F in stages, based on properties of 8, as follows. 

(D.l) 8(A P ) C |Sdif|: following the e'-path and case (B.l), we have 8 A 

8 i > t . On the other hand, following the e-path results in 8 i > oj. We 

now have three sub-cases, based on varying properties of oj: 



(D.l.a) u)(Ap) C : following the e-path and case (A.l). we have, 8 




oj r, where r = oj except for differing range. We then can define 
F(k) — t, where l : A p — > Y is given by: for every e g A p , where 
«(e) = (i,t)£Xx J, t(e) = (1 - i) r ( e ) + tr'(e). This formula is 
illustrated in Figure [24] 
(D.l.b) cj(Ap) n Bq 7 = 0: This is case (A.2). We branch further as follows: 

(i) 8(A P ) C |SdiCo|: Following the e'-path, i Ao ^> 0. Simi- 
larly, following the e-path, 8 i— ^» oj ^ 0. We define F(k) = 0. 

(ii) (5(A P ) ^ |SdJfo|: this is not possible. Suppose it were. This 
implies that there exists at least one vertex &i of a such that V CTi n 
Bp q ^ and V" 7 ' n Z = 0. This means that is completely 
contained in B r (q). Therefore h(&i) is contained in B r (q), which 
contradicts our assumption. 

(D.l.c) oj(Ap) B% and oj(A p ) fl B, 11 / ft we must consider two further 
sub-cases. 

(i) oj is _4'-small: this is case (A. 3. a), and we define F(k) similarly 
to case (D.l.a). 

^# j' 

(ii) oj is not A -small: this is case (A.3.b). Then 8 i > oj i — ► c T , 

where c T = ^ r< for some collection of r, : A p — > _B^. We now 
define F(k) = c t , where c t = ^ tj and each singular simplex 
Li : A p — > V. is defined as follows. Let m be the smallest integer 
such that sd m oj is .A' -small. For each singular simplex 7j in c T , 
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Y 

X x I 



Figure 25: Case (D.l.c): illustration of F. Left: the dark shaded region is the minimal carrier of 
Si and ft;. Right: the shaded regions from top to bottom are the minimal carriers of r', c, and c T 
respectively; the dark shaded regions from top to bottom are the minimal carriers of t[, and 
Ti, respectively. For simplicity, we illustrate the minimal carrier of the singular chain c r as the 
union of the minimal carriers of its simplexes before their retraction. 



there exists a singular simplex oji in sd m uj such that j' = tj. 
For each such LOi, there exists a singular simplex Si in sd m S such 
that h#(Si) = U)i. In other words, for each n in c T , there exist 

Si in sd m 8, such that following the e-path, Si i > uii i — ► Ti. 

Meanwhile, for each such Si, following the e'-path gives Si >—¥ 

Si I > Ti . 

On the other hand, for each such Si, there exists a corresponding 
ft, in sd m n, such that Si = n(ni). We now define tj for each 
such Ki. For all e G A p where fCi(e) = (x,t) 6 X x /, t,(e) = 



(1 — i)Ti( e ) + ( e )- This is illustrated in Figure 25 
(D.2) <5(A P ) ^ |Sdif| : we again have two sub-cases: 

(D.2.a) S(A P ) n jSdA"! =0 : following the e'-path and case (B.2), 8 J-> 

Oi > 0. Since 5(A p )n|Sd K\ — 0, this implies that its corresponding 

a £ SdK. That is, for all a u V ai n B% q = 0, therefore all h(&i) lie 
outside of B r (q). Let w = h# (8) = /i o 5. This means oj has its image 

outside of B^. Then following the e-path, 8 i— ^> cj 0. We define 
F(re) = 0. 

(D.2.b) (<5(Aj,) n |Sdif|) C |SdA' | : following the e'-path and case (B. 3), 

/' ^# 

we have, cr i — > i > 0. On the other hand, let w = h#(8) — ho 8. 



h 



Then oj(A p ) C Po and so following the e-path give a h— > u; 
We define F(k) = 0. 
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Figure 26: Illustration of G. 



Construction of D. To define our chain homotopy D, we first need the follow- 
ing lemma: 

Lemma C.6 ( H271 . page 171) There exists for each space X and each non-negative 
integer p, a homomorphism G p : C P (X) — > C p +i(X X /), having the following 
property: if 8 : A p — > X is a singular simplex, then dGS+GdS = j#(5) + i#(8), 
where the map i : X — > X X I carries x to (x, 0), and the map j : X — > X X / 
carries x to (x, 1). 

This homomorphism is illustrated intuitively in Figure [26] where 5 x Id carries 
a singular p + 1 chain that fills up the entire prism A p x / to a singular chain 
on X x I, and the maps i,j : A p — s> A p x / carry each x to (x, 0) and (x, 1) 
respectively. Then, as promised, we set D p — F p +i o G p . To show that D is a 
chain homotopy between e and e', we calculate 

dD = d(FG) 
= FdG 

= F(j# + G3) 
= Fj # + Fi # + FGd 
= Fj # + Fi # + DO 

Hence we need only show that Fj# — e' and Fi# = e to complete the argument. 
In the case when F(k) is defined to be 0, the corresponding e(5) and e'(S) are also 
0, so this is no problem. In the case when F(k) is not defined to be 0, as shown in 



Figure 24 and Figure 25 Fj#(8) = e'(S), and Fi#(5) = e(5). This concludes 
the proof that the upper square in diagram[9]commutes up to chain homotopy, and 
thus that the bottom square of diagram[8]commutes. 

C.3.4 Top Square of Diagram|8] 

As promised above, we now prove that the top square of diagram [8] commutes, 
which will complete the proof that the left face of Figure [2T| commutes. In fact, 
the top square commutes on the chain level, which we draw directly below. 

C(B P ,8B P ) A C(B pq ,dB pq ) 
C{B p ,Po) C(B pq , Zo) 



Setting e = j' o i# and e' — o j, we show, once again via an exhaustive case 
analysis, that e = e'. 
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First we need to understand the map j for a linear singular simplex. The inter- 
pretation of j is almost the same as that of j' (case (A)). More specifically, we let 
lj : A p — > Bp be an arbitrary linear singular simplex. There are three cases: 

(E.l) uj(A p ) C Bq : then u & u. 

(E.2) u}(Ap) n B^ — : then u^O. 

(E.3) oj(Ap) £ B^ andcj(Ap) n B^ / : We have two sub-cases: 

(E.3.a) ui is ,4-small: then w ^> 7, where 7 : A p — > is defined via the 
retraction-type arguments above. 

(E.3.b) oj is not ,4-small: then oj 1 — > c 7 , where c 7 = 7i> with each 7* : 
A p — > described by the subdivision and retraction arguments we 
have already given. 

To complete the proof, we fix an arbitrary singular simplex S : A p — > Bp , and 
again argue by cases. 

(F.l) <5(Ap) C Bg : exploiting the analysis above, we note that following the e'- 

path results in 8 4> 8 1— ^> S, while following the e-path gives 8 h^> (5 4-> 5, 
as needed. 

(F.2) 5(A P ) n Bq — : here both paths result in 0. 

(F.3) S(Ap) (j- B^ and 8(A P ) n -B^ 7 7^ : here we must analyze two sub-cases: 

(F.3.a) 8 is ,4-small: this implies that S(A P ) C. X — V. Following the e-path 

gives. (5^7 1— ^> 7. On the other hand, 8 is also _4'-small, since A 
and A' share the element X — V, and hence the e' path 

But really the fact that X — V is part of .4 and A' means that r and 7 
follow the same retraction, and thus 7 = r. 
(F.3.b) 5 is not ,4-small: the analysis here is the same as the last case, with 
some words about subdivision added. 

C.4 Finale 

We are now ready to finish the proof of Theorem |5.1| which boils down to verifying 
that diagram[5]commutes, with the vertical maps being isomorphisms. That is, 

. . . — xker (f>a — > kcr 

. . . — Kker tpa ker ipp — > .... 

But this is now just easy diagram-chasing. Commutativity of diagram [5] follows 
directly from the commutativity of the bottom face of the double-cube in Figure 
|2 1 1 and the leftmost (rightmost) vertical isomorphism derives from our statements 
about the left (right) face of the double-cube. The commutativity of the top face 
implies that the cokernel analogue to diagram[5]commutes, after a little more alge- 
bra which we omit. 
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D Proof of Theorem |33] 



In this Appendix, we give a proof for Theorem 1 3. 3 1 First we need a technical 
lemma involving some simple algebraic topology. 

D.l Absolute Homology Modules 

Recall from before that <r(p, r) is the feature size of the relative homology persis- 
tence module {H(Bp, dBp )}. On the other hand, the same thickening process also 
defines two absolute homology persistence modules, {H(Bp)} and {H(dBp)}. 
We let <Ji(p, r) and <Jb{p, r) denote the feature sizes of these modules. Similarly, 
we define Ui(p,q,r) and a b (p,q,r), respectively, to be the feature sizes of the 
absolute homology persistence modules {H(Bp q )} and {H(dBp q )}. 

Theorem D.l (Relative/Absolute Lemma) The feature size of each relative mod- 
ule is at least the minimum of those of its two associated absolute modules: 

a(p,r) > xmn{cri(p,r),a b (p,r)}, 
v(p,q,r) > mm{o-i(p,q,r),a b (p,q y r)}. 

PROOF. We prove the first equality; the second can then be proven with only 
minor notational adjustment. For any two non-negative reals a < j3, and for each 
homological dimension i > 0, consider the following commutative diagram: 

Hi{dBj;(a)) ->■ Hi(Bf(a)) ->■ H t (B^(a),dBf(a)) -> H«_i(0B*(a)) Hi_i(Bj(a)) 

Hi(dB$(0)) Hi(Bp(f3)) Hi(Bp (/3) , dBp (ft)) H t -i(dB${p)) H t -i{B%(J3)) 

(11) 

where the vertical maps are induced by the inclusion X a <— > Xp and the two 
rows come from the long exact sequences of the pairs (Bp (a), d Bp (a)) and 

(BjO9),0BjO9))(Ea)- 

Suppose that the middle vertical map fails to be an isomorphism. Then the 

Five-Lemma ( 1271 . p. 140) tells us that at least one of the other four vertical maps 

will also fail to be an isomorphism. In other words, any change within the relative 

module must be accompanied by a simultaneous change in at least one of the two 

absolute modules. The inequality follows. EQ 

This theorem together with the definition of p(p, q, r) implies the following 
inequality 

P(p,q,r) > mm{a l (p,r),a b (p,r),a i (p,q,r),a b (p,q,r)}. (12) 

D.2 Proof 

To prove Theorem |3.3| we will further lower bound the tr-parameters above. This 
is accomplished via one more lemma. 
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Lemma D.l (Deformation Lemmas) The following four claims all hold for ev- 
ery small enough 8 > 0. In each of the claims, the homotopy equivalence is given 
by a deformation retraction: 



Va < mm{r(p,r),i 1 (p,r)},(X a nB r {p)) ~ (X s n B r (p)), 

Va < mm{T (p,q,r),r]{ P ,r)},{X a r\dB T (p)) ~ (X 4 ndB r (p)), 

Va < min(r(p, 9 ,r),t)(j), 9 ,r)}, (X a n S r (p) n B r (?)) ^ (X 4 n B r (p) n Sr(g)), 

Va < min{r (p, g, r), v {p, q, r)}, (X a n 9(-Br(p) n Br(g))) ^ (Xa n d{B r (p) n B r («))). 

PROOF. All four claims follow from Stratified Morse Theory |19|. We prove 
only the first claim; the other three can be proven with only slight modifications. 
Consider the stratification of B r (p) with singular set E = M(p, r) U dB r (p) and 
whatever further decomposition of E is needed. Setting d = d%\B r (p) : B r (p) — > 
R, we note that the sets X a n B r (p) are simply the sublevel sets of d for various 
parameters a. Generically, d will be a Stratified Morse function on B r (p) with 
its above stratification. Consider the set H of all critical points of d which have 
positive d- value. 

We claim H C (A4(p, r) U G(p, r)) : to see this, we suppose y £ H and we 
assume first that y is in the interior of B r (p). Then y is also a critical point of the 
globally defined function d%, and since d(x) — dx(x) > 0, we know that y € M. 
Since y is also in B r (p) by assumption, we know in fact that y 6 A4(p,r). On 
the other hand, suppose that y £ dB r (p); we can also assume that y £ M(p, r) 
or we are already done. Then by definition y is a critical point of the restriction of 
dji to dB r (p). Since the gradient of this latter function is simply the projection of 
Vdx onto dB r (p), we can conclude y £ G(p, r). 

In other words, if a < {r(p, r), r)(p, r)}, then (X a H B r (p)) n i? = 0, and 
hence the interval [5, a] contains no critical values of d. The claim then follows 
from the first fundamental theorem of Stratified Morse Theory 1 19). E3 

Finally, we finish the proof of Theorem |3.3| which we restate here for conve- 
nience. 

Theorem D.2 (Geometric lower bound) If we define 

7 = t(Pj <?. r ) = min{r(p, r),r(p, q, r), rj(p, r), r)(p, q,r)}, 
then p(p,q,r) > j(p,q,r). 
Proof. 

Note that r(p, r) < To(p, r) and r(p, q, r) < i~o(p, q, r) so we need not con- 
sider To(p, r) and To(p, q, r). 

Recall Oi{p, r) and <7f,(p, r) were defined to be the feature sizes of the persis- 
tence modules {H(Bp(a))} and {H(dBp (a))}, respectively. 

By the first and second of the Deformation Lemmas the following holds 

Oi(p,r),a b (p,r) > min{r(p, r), 77 (p, r)}. 
For the same reason 

dip, q, r),a b (p, q, r) > min{r(p, q, r), r/(p, q,r)}. 
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These inequalities, together with \\2\ 

p{p,Q,r) > min{a t (p,r),a b (p,r),ai(p,q,r),a b (p,q,r)}, 
prove the theorem, p(p, q, r) > 7(p, q, r). EH 
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